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Abstract 

We show that if a language is recognized within certain error bounds by constant- 
depth quantum circuits over a finite family of gates, then it is computable in (classical) 
polynomial time. In particular, our results imply 

EQNC C P, 

where EQNC is the constant-depth analog of the class EQP. 

On the other hand, we adapt and extend ideas of Terhal & DiVincenzo TD02] to 
show that, for any family T of quantum gates including Hadamard and CNOT gates, 
computing the acceptance probabilities of depth- five circuits over T is just as hard as 
computing these probabilities for arbitrary quantum circuits over T ' ■ In particular, this 
implies that 

NQNC = NQACC = NQP = coC=P, 

where NQNC is the constant-depth analog of the class NQP. This essentially refutes 
a conjecture of Green et al. that NQACC C TC° [GHMP02]. 



1 Introduction 



Quantum decoherence is a major obstacle to maintaining long quantum computations. The 
first working quantum computers will almost certainly be limited to realizing shallow — i.e., 
small-depth — quantum circuits. This dilemma has inspired much theoretical interest in the 
capabilities of these circuits, particularly circuits that have constant depth and polynomial 
size. 

Recently, people have found that much can be done with 0(logn)-depth circuits. For 
example, Cleve & Watrous were able to approximate the Quantum Fourier Transform over 
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modulus 2 n with 0(logn)-depth circuits [CWOOJ. Log-depth seems to present a barrier 
for many computational problems, however; getting significantly shallower circuits appears 
difficult if not impossible — unless gates of unbounded width (i.e., number of qubits, or fan-in) 
are allowed. This has led to the study of const ant- depth quantum circuits that can contain 
certain classes of unbounded fan-in gates. 

There are a number of unbounded- width gate classes studied in the literature, most being 
defined in analogy to classical Boolean gates. The generalized Toffoli gate (see Section |2~T|) is 
the quantum equivalent of the unbounded Boolean AND-gate. Likewise, there are quantum 
equivalents of Mod-gates and threshold gates. One particular quantum gate corresponds to 
something taken almost completely for granted in Boolean circuits — fan-out. A fan-out gate 
copies the (classical) value of a qubit to several other qubits at once. 1 Using these gates, one 
can define quantum versions of various classical circuit classes: QNC fc (Moore & Nilsson 
|MN()2j ). QAC fc and QACC fe (Moore |Moo99| . Green et al. |GHMP()2| ). and QTC fc are 
analogous to NC fc , AC fc , ACC, and TC fc , respectively. The case of particular interest is 
when k = 0. All these classes are allowed constant-width gates drawn from a finite family. 
The classes differ in the additional gates allowed. QNC° is the most restrictive class; all 
gates must have bounded width. QAC fc circuits are allowed generalized Toffoli gates, and 
QACC fc circuits are allowed Mod g -gates, where q is kept constant in each circuit family. 
QTC fc circuits are allowed quantum threshold gates. See Section |2~T1 for detailed definitions 
of most of these classes. 

Although quantum classes are defined analogously to Boolean classes, their properties 
have turned out to be quite different from their classical versions. A simple observation 
of Moore |Moo99j shows that the n-qubit fan-out gate and the n-qubit parity (Mod2) gate 
are equivalent up to constant depth, i.e., each can be simulated by a constant-depth circuit 
using the other. This is completely different from the classical case, where parity cannot 
be computed even with AC circuits, where fan-out is unrestricted |Ajt83| , IFSS 84J. Later, 
Green et al. showed that quantum Mod 9 -gates are constant-depth equivalent for all q > 1, 
and are thus all equivalent to fan-out. Thus, for any q > 1, 

QNCj = QACC°(g) = QACC . 

(The / subscript means, "with fan-out.") The classical analogs of these classes are provably 
different. In particular, classical Mod p and Mod g gates are not constant-depth equivalent if 
p and q are distinct primes, and neither can be simulated by AC circuits |Raz871 rSmo87j . 

Using QNC° circuits with unbounded fan-out gates, H0yer & Spalek managed to paral- 
lelize a sequence of commuting gates applied to the same qubits, and thus greatly reduced 



the depth of circuits for various purposes HS03 . They showed that threshold gates can be 



approximated in constant depth this way, and they can be computed exactly if Toffoli gates 
are also allowed. Thus QTC° = QACC as well. Threshold gates, and hence fanout gates, 
are quite powerful; many important arithmetic operations can be computed in constant 
depth with threshold gates [SBKH93J. This implies that the quantum Fourier transform — 
the quantum part of Shor's factoring algorithm — can be approxmated in constant depth 

1 There is no violation of the No-Cloning Theorem here; only the classical value is copied. 
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using fanout gates. 

All these results rely for their practicality on unbounded-width quantum gates being 
available, especially fan-out or some (any) Mod gate. Unfortunately, making such a gate in 
the lab remains a daunting prospect; it is hard enough just to fabricate a reliable CNOT gate. 
Much more likely in the short term is that only one- and two-qubit gates will be available, 
which brings us back to the now more interesting question of QNC°. How powerful is 
this class? Can QNC° circuits be simulated classically, say, by computing their acceptance 
probabilities either exactly or approximately? Is there anything that QNC° circuits can 
compute that cannot be computed in classical polynomial time? The present paper addresses 
these questions. 

A handful of hardness results about simulating constant-depth quantum circuits with 
constant-width gates were given recently by Terhal & DiVincenzo |TD02j. They showed 
that if one can classically efficiently simulate, via sampling, the acceptance probability of 
quantum circuits of depth at least three using one- and two-qubit gates, then BQP C AM. 
They also showed that the polynomial hierarchy collapses if one can efficiently compute the 
acceptance probability exactly for such circuits. (Actually, a much strong result follows from 
their proof, namely, P = PP.) Their technique uses an idea of Gottesman & Chuang for 
teleporting CNOT gates [GC99] to transform an arbitrary quantum circuit with CNOT and 
single-qubit gates into a depth-three circuit whose acceptance probability is proportional to, 
though exponentially smaller than, the original circuit. Their results, however, only hold on 
the supposition that depth-three circuits with arbitrary single-qubit and CNOT gates are 
simulatable. We build on their techniques, making improvements and simplifications. We 
weaken their hypothesis by showing how to produce a depth-three circuit with essentially 
the same gates as the original circuit. In addition, we can get by with only with simple qubit 
state teleportation [BBC+93 . Our results immediately show that the class NQNC (the 
constant-depth analog of NQP, see below), is actually the same as NQP, which is known 
to be as hard as the polynomial hierarchy FGHP99] . We give this result in Section 13.11 
It underscores yet another drastic difference between the quantum and classical case: while 
AC is well contained in P, QNC° circuits (even just depth-three) can have amazingly 
complex behavior. Our result is also tight; Terhal & DiVincenzo showed that the accep- 
tance probabilities of depth-two circuits over one- and two-qubit gates are computable in 
polynomial time. 

In Section 13. 2\ we give contrasting upper bounds for QNC°-related language classes. 
We show that various bounded-error versions of QNC° (defined below) are contained in P. 
Particularly, EQNC C P, where EQNC is the constant-depth analog of the class EQP 
(see below). Our proof uses elementary probability theory, together with the fact that single 
output qubit measurement probabilities can be computed directly, and the fact that output 
qubits are "largely" independent of each other. In hindsight, it is not too surprising that 
EQNC C P. EQNC sets a severe limitation on the behavior of the circuit: it must accept 
with certainty or reject with certainty. This containment is more surprising (to us) for the 
bounded-error QNC° classes. 

We give open questions and suggestions for further research in Section HJ 
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2 Preliminaries 



2.1 Gates and circuits 

We assume prior knowledge of basic concepts in computational complexity: polynomial time, 
P, NP, as well as the counting class #P |Val79j . Information can be found, for example, 
in Papadimitriou |Pap94| . The class C^P (coC = P) was defined by Wagner |Wag86| . One 
way of defining C^P is as follows: a language L is in C^P iff there are two j^P functions 
/ and g such that, for all x, x G L •<=>- f(x) ^ g(x). C^P was shown to be hard for the 
polynomial hierarchy by Toda & Ogihara |T()92j . 

We will also assume some (but less) background in quantum computation and the quan- 
tum circuit model. See Nielsen and Chuang |NC00| for a good reference of basic concepts 



and notation. 

We review some standard quantum (unitary) gates. Among the single-qubit gates, we 
have the Pauli gates X, Y, and Z, the Hadamard gate H, and the ir/8 gate T, which are 
defined thus, for b G {0, 1}: 

X\b) = h&>, 

Y\b) = i(-l) b hb), 

Z\b) = (-l) b \b), 

H\b) = (|0) + (-l) b |l))/v / 2, 

T\b) = e inb/ *\b). 

For n > 1, the (n + l)-qubit generalized Toffoli gate T n satisfies 

n 

T n \xi, . . . , x n , b) = \xi, . . . , x n , b © y\ x^. 

i=i 

Here b is the target qubit and the control qubits. T n is a kind of multiply 

controlled X-gate (or NOT-gate), and is the quantum analog of the Boolean AND-gate with 
fanin n. T2 is known simply as the Toffoli gate. T\ is also known as the controlled NOT 
(CNOT) gate and is depicted below. Here, a, b G {0, 1}. 



a®b 

A gate closely related to T n is the controlled Z-g&te defined by 

Zn,\x± 7 ... , X n ) ( 1)^ 1— 1 \xi, • • • , X n ). 

Since HXH = Z, the gate Z n+ i-gate can be implemented by placing if-gates on either side 
of a T n gate on its target qubit. 
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The (n + l)-qubit fan-out gate F n is denned as follows: 

F„|xi, . . . ,x n ,b) = \xi © b, . . . ,x n © b,b). 

For g > 1, the {n + l)-qubit Mod^-gate acts on a basis state \x\, . . . ,x n , b) by flipping 
the target qubit b iff x\ + • • • + x n ^ (mod g). The control qubits xi, . . . , x n are left alone. 
The Mod2 gate is also known as the parity gate. The width of a gate is the number of qubits 
on which it acts. 

Our notion of quantum circuits is fairly standard (again see, for example, [NCOOJ): a 
series of quantum gates, drawn from some specified set of unitary operators, acting on 
some specified number of qubits, labeled 1, . . . , m. The first few qubits are considered input 
qubits, which are assumed to be in some basis state initially (i.e., classical input); the rest 
are ancillae, each assumed to be in the |0) state initially. Thus the initial state of the qubits 
is \x, 00 • • • 0), for some binary string x. Some arbitrary set of qubits are specified as output 
qubits, and these qubits are measured in the computational basis at the final state. We 
assume that the sets of input and output qubits are part of the description of the circuit. 
The circuit accepts its input if all the output qubits are observed to be in the final state. 
Otherwise the circuit rejects. We let Pr[C(x)] denote the probability that C accepts input 
x. 

If C is any quantum circuit, it will be convenient for us to define \C\, the size of C, to be 
the number of output qubits plus the number of "contact points" between qubits and gates, 
so for example, a single-qubit gate counts one towards the size, while a two-qubit gate counts 
two, etc. C may be laid out by partitioning its gates into layers 1, . . . , d, such that (i) gates 
in the same layer all act on pairwise disjoint sets of qubits, and (ii) all gates in layer i are 
applied before any gates in layer % + 1, for 1 < i < d. The depth of C is then the smallest 
possible value of d. The width of C is the number of qubits in C. 

The standard quantum complexity classes can be defined in terms of quantum circuit 
families. A quantum circuit family is a sequence {C n }„> of quantum circuits, where each 
C n has n inputs. We say that {C n } is uniform if there is a (classical) polynomial-time 
algorithm that outputs a description of C n on input n . 

Definition 2.1 ( |BV97L IBBBV97L IXDH97j ^ Let L be a language. 

• i 6 EQP iff there is a uniform quantum circuit family {C n } such that, for all x of 
length n, 

x EL Pr[C n (x)] = 1, 

x <£ L =^ Pr[C n (a;)] = 0. 

• L G BQP iff there is a uniform quantum circuit family {C n } such that, for all x of 
length n, 

x EL Pr[C n (x)] > 2/3, 

x L Pr[C n (a?)] < 1/3. 
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L G NQP iff there is a uniform quantum circuit family {C n } such that, for all x of 
length n, 



xeL =^ Pi[C n (x)} > 0, 
x £ L Pr[C n (a:)] = 0. 

It is known that P C EQP C BQP. It was shown in |FGHP991 IYY99j that NQP = 
C^P, and is thus hard for the polynomial hierarchy. 

2.2 Complexity classes using QNC circuits 

The circuit class QNC was first suggested by Moore and Nilsson |MN02| as the quantum 
analog of the class NC of bounded fan-in Boolean circuits with polylogarithmic depth and 
polynomial size. We define the class QNC fe in the same fashion as definitions in Green, 
Homer, Moore, & Pollett [GHMP02 with some minor modifications. 

Definition 2.2 (|MN02J) QNC fc is the class of quantum circuit families {C n }n>o for 
which there exists a polynomial p such that each C n contains n input qubits and at most p(n) 
many ancillce. Each C n has depth 0(\og k n) and uses only single-qubit gates and CNOT 
gates. The single-qubit gates must be from a fixed finite set. 

Next we define the language classes NQNC fe and EQNC fc . These are QNC fc analogs of 
the classes NQP and EQP, respectively. 

Definition 2.3 ([GHMP02J) Let k > be an integer. 

• NQNC fc is the class of languages L such that there is a uniform {C n } G QNC fc such 
that, for all x, 

iGL ^ Pi[C lxl (x)] > 0. 

• EQNC fc is the class of languages L such that there is a uniform {C n } G QNC fc such 
that, for all x, Pi[C\ x \(x)] G {0, 1} and 

i6i ^ Pr[C N (x)] = 1. 

Remark. Green, Homer, Moore, & Pollett implicitly consider the output qubits of C n to 
be all the qubits in C n |GHMP02] . In our model we allow any subset of qubits to be the 
output qubits of C n , and we do not restrict our circuits to be clean, i.e., the non-output 
qubits could end up in an arbitrary state, possibly entangled with the output qubits. The 
reason we define our circuits this way is based on the observation that, in their model, if a 
language L is in EQNC fc (or BQNC^ for large enough 5), then L can contain no more 
than one string of each length. 

Bounded-error QAC fe classes were mentioned in jGHMP02] . and one can certainly ask 
about similar classes for QNC fc circuits. It is not obvious that there is one robust definition 
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of BQNC — perhaps because it is not clear how to reduce error significantly by amplification 
in constant depth. 2 In the next definition, we will try to be as general as possible while still 
maintaining our assumption that is the only accepting output. 

Definition 2.4 Let e and 5 be functions mapping (descriptions of) quantum circuits into 
real numbers such that, for all quantum circuits C , < e(C) < b~{C) < 1. We write ec and 
5c to denote e(C) and 5(C), respectively. BQNC^ is the class of languages L such that 
there is a uniform {C n } G QNC fc such that for any string x of length n, 

x eL Pr[C n (x)} > 5 Cn , 

x£L Pr[C n (x)] < e Cn . 

An interesting special case is when ec = 5c = 1, that is, the input is accepted iff the 
circuit accepts with probability 1, and there is no promise on the acceptance probability. 
One might expect that, by the symmetry of the definitions, this class BQNC*! 1 1 is the same 
as NQNC , but it is almost certainly not, as we will see. 

2.3 Other classes of constant-depth quantum circuits 
Definition 2.5 Let k > and q > 1 be integers. 

• QAC fe is the same as QNC fe except that generalized Toffoli gates are allowed in the 
circuits. 

• QACC(g) is the same as QNC° except that Mod g gates are allowed in the circuits. 

• QACC = U g>1 QACC(g). 

3 Main results 

3.1 Simulating QNC° circuits exactly is hard 
Theorem 3.1 NQNC = NQP = C^P. 

As a corollary, we essentially solve an open problem of Green et al. [HHMP02 . They con- 
jectured that NQACC C TC°, the class of constant-depth Boolean circuits with threshold 
gates. 

Corollary 3.2 For any k > 0, 

NQNC = NQNC fc = NQAC fc = NQACC = C^P. 

Thus, NQACC % TC° unless C^P = TC°. 

2 One can always reduce error classically by just running the circuit several times on the same input. 
In this case, the best definition of BQNC may be that the gap between the allowed accept and reject 
probabilities should be at least 1/poly. 
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Figure 1: The nonadaptive teleportation module |TD02j . The state in qubit q is teleported 
correctly iff the qubits t\ and r 2 are both observed to be 0. 



Let B be the two-qubit Bell gate, defined as 
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which produces the EPR state (|00) + \ll))/V2. We prove the following lemma, from which 
the theorem follows quickly. 

Lemma 3.3 For any quantum circuit C using gates drawn from any family J 7 , there is a 
depth-three quantum circuit C of size linear in \C\ using gates drawn from JFU {B, B^} such 
that for any input x of the appropriate length, 

Pr[C'(x)\ = 2- m Pr[C(a;)], 

for some m < 2\C\ depending only on C. The middle layer of C contains each gate in 
C exactly once and no others. The third layer contains only B' -gates, and the first layer 
contains only B-gates, which are used only to create EPR states. 

Proof. Our construction is a simplified version of the main construction in Terhal & Di- 
Vincenzo |TD02| . but ours is stronger in one crucial respect discussed below: it does not 
significantly increase the family of gates used. To construct C, we start with C and simply 
insert, for each qubit q of C, a simplified teleportation module (shown in Figure between 
any two consecutive quantum gates of C acting on q. No further gates involve the qubits 
ri and r 2 to the right of the S^-gate. This module, which lacks the usual corrective Pauli 
gates, is a nonadaptive version of the standard single-qubit teleportation circuit BBC + 93j . 
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It faithfully teleports the state if and only if the observed output of the B^-gate on the 
right is 00. After inserting each teleportation circuit, the gates acting before and after it are 
now acting on different qubits. Further, it is important to note that any entanglement the 
qubit state has with other qubits is easily seen to be preserved in the teleported qubit. The 
input qubits of C are those of C. The output qubits of C are of two kinds: output qubits 
corresponding to outputs of C are the original outputs; the other outputs are the qubits (in 
pairs) coming from the added -B^-gates. We'll call the measurement of each such pair a Bell 
measurement, even though it is really in the computational basis. 

In addition to the gates in C, C uses only 5-gates to make the initial EPR pairs and 
.B^-gates for the Bell measurements. A sample transformation is shown in Figure El C has 
depth three since it uses the first layer to make the initial EPR states and the third layer 
to rotate the Bell basis back to the computational basis. All the gates of C appear on the 
second layer. From the above constuction and the properties of the teleportation module, it 
is not hard to see that for all x of the appropriate length, 

Pr[C(x)] = Pr[all original outputs of C are | all qubit states are teleported correctly] 
= Pr[all original outputs of are | all Bell measurement results are 00] 

Pr[C'(x)] 

Pr [all Bell measurement results are 00] ' 

since the Bell measurements are among the output measurements of C. Let k be the number 
of 5^-gates on layer 3. Clearly, k < \C\, and it is well-known that each Bell measurement will 
give 00 with probability 1/4, independent of all other measurements. So the lemma follows 
by setting m = 2k. □ 

Proof of Theorem 13.11 As mentioned before, NQP ADH97, is defined as the class of 
languages recognized by quantum Turing machines (equivalently, uniform quantum circuit 
families over a finite set of gates) where the acceptance criterion is that the accepting state 
appear with nonzero probability. It is known |FGHP99"1 IYY99j that NQP = C^P, which 
contains NP and is hard for the polynomial hierarchy. Since QNC° circuit families must 
also draw their gates from some finite set, we clearly have NQNC C NQP. The reverse 
containment follows from our construction: an arbitrary circuit C is transformed into a 
depth-three circuit C with the same gates as C plus B and B'. Moreover, C accepts with 
nonzero probability iff C does. Thus an NQP language L recognized by a uniform quantum 
circuit family over a finite set of quantum gates is also recognized by a uniform depth-three 
circuit family over a finite set of quantum gates, and so L G NQNC . □ 

Using the gate teleportation apparatus of Gottesmann and Chuang |GC99j . Terhal & 
DiVincenzo also construct a depth-three 3 quantum circuit C out of an arbitrary circuit C 
(over CNOT and single-qubit gates) with a similar relationship of acceptance probabilities. 
However, they only teleport the CNOT gate, and their C may contain single-qubit gates 

3 They count the depth as four, but they include the final measurement as an additional layer whereas we 
do not. 
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Figure 2: A sample transformation from C to C. The circuit C on the left has five gates: S, 
T, U, V, and W, with subscripts added to mark which qubits each gate is applied to. The 
qubits in C are numbered corresponding to those in C. 
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formed by compositions of arbitrary numbers of single-qubit gates from C. (Such gates may 
not even be approximable in constant depth by circuits over a fixed finite family of gates.) 
When their construction is applied to each circuit in a uniform family, the resulting circuits 
are thus not generally over a finite gate set, even if the original circuits were. 

Our construction solves this problem by teleporting every qubit state in between all gates 
involving it. Besides B and B>, we only use the gates of the original circuit. We also are able 
to bypass the CNOT gate teleportation technique of |GC99j . using instead basic single-qubit 
teleportation |BBC + 93] . which works with arbitrary gates. 

3.2 Simulating QNC° circuits approximately is easy 

In this section we prove that BQNC° 5 C P for certain e, 5. For convenience we will as- 
sume that all gates used in quantum circuits are either one- or two-qubit gates that have 
"reasonable" matrix elements — algebraic numbers, for instance. Our results can apply more 
broadly, but they will then require greater care to prove. 

For a quantum circuit C, we define a dependency graph over the set of its output qubits. 

Definition 3.4 Let C be a quantum circuit and let p and q be qubits of C. We say that 
q depends on p if there is a forward path in C starting at p before the first layer, possibly 
passing through gates, and ending at q after the last layer. More formally, we can define 
dependence by induction on the depth of C. For depth zero, q depends on p iff q = p. For 
depth d > 0, let C be the same as C but missing the first layer. Then q depends on p (in C) 
iff there is a qubit r such that q depends on r (in C) and either p = r or there is a gate on 
the first layer of C that involves both p and r. 

Definition 3.5 For C a quantum circuit and q a qubit ofC, define 

D q = {p | q depends on p}. 

If S is a set of qubits of C, define D s = [j qeS Dg. Let the dependency graph of C be the 
undirected graph with the output qubits of C as vertices, and with an edge between two qubits 
qi and q 2 iff D qi fl^/t). 

If C has depth d, then it is easy to see that the degree of its dependency graph is less 
than 2 2d . The following lemma is straightforward. 

Lemma 3.6 Let C be a quantum circuit and let S and T be sets of output qubits of C. Fix 
an input x and bit vectors u and v with lengths equal to the sizes of S and T , respectively. 
Let Es= u (respectively Et= v ) be the event that the qubits in S (respectively T) are observed 
to be in the state u (respectively v) in the final state of C on input x. If Ds D = 0, then 
E s=u and E T=V are independent. 

For an algebraic number a, we let ||a|| be the size of some reasonable representation of a. 
The results in this section follow from the next theorem. 
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Theorem 3.7 There is a deterministic decision algorithm A which takes as input 

1. a quantum circuit C with depth d and n input qubits, 

2. a binary string x of length n, and 

3. an algebraic number t 6 [0, 1], 

and behaves as follows: Let D be one plus the degree of the dependency graph of C. A runs 
in time Poly(|C|, 2 2 , ||t||), and 

• ifPv[C(x)] > 1 — t, then A accepts, and 

• ifPr[C(x)] < 1 — Dt, then A rejects. 

Note that since D < 2 2d , if t < 2~ 2d , then A will reject when Pr[C(x)] < 1 - 2 2d t. 
Proof of Theorem 13.71 On input (C, x, t) as above, 

1. A computes the dependency graph G = (V,E) of C and its degree, and sets D to be 
the degree plus one. 

2. A finds a D-coloring c : V —> {1, . . . , D} of G via a standard greedy algorithm. 

3. For each output qubit q G V, A computes P q — the probability that is measured on 
qubit q in the final state (given input x). 

4. For each color i £ {1, . . . , D}, let Bi = {q e V \ c(q) = i}. A computes 

p* = n p i> 

q<=Bi 

which by Lemma 13.61 is the probability that all qubits colored % are observed to be 
in the final state. 

5. If > 1 — t for all i, the A accepts; otherwise, A rejects. 

We first show that A is correct. If Pr[C(x)] > 1 — t, then for each i E {1, . . . , D}, 

l-t<Pr[C(x)] <P Bi , 
and so A accepts. On the other hand, if Pr[C(x)] < 1 — Dt, then 

D 



Dt < 1 - Pi[C(x)] < (1 - p Bi) 



so there must exist an i such that t < 1 — Pg. , and thus A rejects. 

To show that A runs in the given time, first we show that the measurement statistics of 
any output qubit can be calculated in time polynomial in 2 2 . Pick an output qubit q. By 
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looking at C we can find D q in time Poly(|C|). Since C has depth d and uses gates on at most 
two qubits each, D q had cardinality at most 2 d . Then we simply calculate the measurement 
statistics of output qubit q from the input state restricted to D q , i.e., with the other qubits 
traced out. This can be done by computing the state layer by layer, starting with layer one, 
and at each layer tracing out qubits when they no longer can reach q. Because of the partial 
traces, the state will in general be a mixed state so we maintain it as a density operator. We 
are multiplying matrices of size at most 2 2 x 2 2 at most 0(d) times. All this will take time 
polynomial in 2 2 , provided we can show that the individual field operations on the matrix 
elements do not take too long. 

Since there are finitely many gates to choose from, their (algebraic) matrix elements 
generate a field extension F of Q with finite index r. We can thus store values in F as r-tuples 
of rational numbers, with the field operations of F taking polynomial time. Furthermore, 
one can show that for a,b E F, \\ab\\ = 0(||a|| + ||6||) and 11X^=1 a «ll = 0(n • maxj ||aj||) for 
any ax, . . . , a n e F. A bit of calculation then shows that the intermediate representations of 
numbers do not get too large. 

The dependency graph and its coloring can clearly be computed in time Poly ( |C| ) . The 
only things left are the computation of the Pb. and their comparison with 1 — t. For reasons 
similar to those above for matrix multiplication, this can be done in time Poly ( | C | , 2 2 , ||i||). 
□ 

Corollary 3.8 Suppose e and 5 are polynomial-time computable, and for any quantum cir- 
cuit C of depth d, 5 C = 1 - 2- 2d (l - e c ). Then 

BQNC° 5 C P. 

Proof. For each C of depth d in the circuit family and each input x, apply the algorithm A 
of Theorem O with t = 1 - 5 C = 2" 2d (l - e c ), noting that D < 2 2d . □ 

The following few corollaries are instances of Corollary 13.81 

Corollary 3.9 For quantum circuit C, let 5c = 1 — 2~^ 2d+l \ where d is the depth of C . 
Then 

BQNC° 1/2M C P. 

Proof. Apply algorithm A to each circuit, setting t = 2~( 2d+1 \ □ 
Corollary 3.10 BQNC^ C P. 

Proof. Apply algorithm A to each circuit, setting t = 0. □ 
Corollary 3.11 EQNC C P. 
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Proof. Clearly, EQNC° C BQNC^. □ 

Corollaries I3.1UI and 13.111 can actually be proven more directly. We simply compute, for 
each output, its probability of being 0. We accept iff all probabilities are 1. 

We observe here that by a simple proof using our techniques, one can show that the 
generalized Toffoli gate cannot be simulated by a QNC° circuit, since the target of the 
Toffoli gate can only depend on constantly may input qubits. 

4 Conclusions, open questions, and further research 

Our upper bound results in Section 13.21 can be improved in certain ways. For example, the 
containment in P is easily seen to apply to (log log n + 0(l))-depth circuits as well. Can we 
increase the depth further? Another improvement would be to put BQNC° 5 into classes 
smaller than P. LOGSPACE seems managable. How about NC 1 ? 

There are some general questions about whether we have the "right" definitions for these 
classes. For example, the accepting outcome is defined to be all outputs being 0. One 
can imagine more general accepting conditions, such as arbitrary classical polynomial-time 
postprocessing. If we allow this, then all our classes will obviously contain P. If we allow 
arbitrary classical polynomial-time preprocessing, then all our classes will be closed under 
ptime m-reductions (Karp reductions). 

Finally, there is the question of the probability gap in the definitions of BQNC . Cer- 
tainly we would like to narrow this gap (ideally, to 1/poly) and still get containment in 
P. 
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